Comparison of Time-Frequency Feature Extraction Techniques for Environmental Sound Recognition

نویسندگان

  • MICHAEL COWLING
  • RENATE SITTE
چکیده

This paper is the continuation of previously published work in which we have been analysing different methods – traditionally used in speech recognition – for their suitability to be applied to Environmental Sound Recognition. While current research devotes much effort to speech and speaker recognition, Environmental Sound Recognition is an area where little research has been reported. Despite this, environmental sound recognition is important for areas such as surveillance, because microphones need to be less focused than a video surveillance camera. This paper discusses a combinatorial experiment that investigates the use of time-frequency feature extraction techniques such as STFT and Wavelets, combined with speech recognition system learning techniques (such as the AI techniques of LVQ and ANN) for the classification of non-speech environmental sounds. This experiment reveals that a combination of a continuous wavelet transform with dynamic time warping produces the best results for environmental sound recognition. This performance is superseded only by performance on speech by Hidden Markov Models, which unfortunately are unsuitable for our purpose. Key-Words: non-speech sound recognition, environmental sound recognition, auditory signal processing, acoustic signal processing, joint time-frequency feature extraction

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison Between Different Methods of Feature Extraction in BCI Systems Based on SSVEP

‎There are different feature extraction methods in brain-computer interfaces (BCI) based on Steady-State Visually Evoked Potentials (SSVEP) systems‎. ‎This paper presents a comparison of five methods for stimulation frequency detection in SSVEP-based BCI systems‎. ‎The techniques are based on Power Spectrum Density Analysis (PSDA)‎, ‎Fast Fourier Transform (FFT)‎, ‎Hilbert‎- ‎Huang Transform (H...

متن کامل

Supervised Feature Extraction of Face Images for Improvement of Recognition Accuracy

Dimensionality reduction methods transform or select a low dimensional feature space to efficiently represent the original high dimensional feature space of data. Feature reduction techniques are an important step in many pattern recognition problems in different fields especially in analyzing of high dimensional data. Hyperspectral images are acquired by remote sensors and human face images ar...

متن کامل

Noise-Robust environmental sound classification method based on combination of ICA and MP features

This paper presents an environmental sound classification method that is noise-robust against sounds recorded by mobile devices, and presents evaluation of its performance. This method is specifically designed to recognize higher semantics of context from environmental sound. Conventionally, sound classifications have used acoustic features in the frequency domain extracted from sound data usin...

متن کامل

Using Spectro-Temporal Features for Environmental Sounds Recognition

The paper presents the task of recognizing environmental sounds for audio surveillance and security applications. A various characteristics have been proposed for audio classification, including the popular Mel-frequency cepstral coefficients (MFCCs) which give a description of the audio spectral shape. However, it exist some temporal-domain features. These last have been developed to character...

متن کامل

A Real-Time Electroencephalography Classification in Emotion Assessment Based on Synthetic Statistical-Frequency Feature Extraction and Feature Selection

Purpose: To assess three main emotions (happy, sad and calm) by various classifiers, using appropriate feature extraction and feature selection. Materials and Methods: In this study a combination of Power Spectral Density and a series of statistical features are proposed as statistical-frequency features. Next, a feature selection method from pattern recognition (PR) Tools is presented to e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002